On Deriving Indicators from Texts

نویسندگان

  • Steven O. Kimbrough
  • Thomas Y. Lee
  • Ulku Oktem
چکیده

This paper presents and explores the idea of deriving numerical indicators from texts, that is, converting text data to numerical data that has predictive or diagnostic value. One application of such a general capability is to the provisional identification of networks, or rather, of associations within networks. Conversely, given a network structure among entities that are associated with various texts, the network structure can itself contribute usefully to construction of indicators derived from texts. The focus of the paper is on basic concepts and methods for deriving indicators from texts. Much research remains to be done.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quality Indicators of LSP Texts - Selection and Measurements Measuring the Terminological Usefulness of Documents for an LSP Corpus

This paper describes and evaluates a prototype quality assurance system for LSP corpora. The system will be employed in compiling a corpus of 11 M tokens for various linguistic and terminological purposes. The system utilizes a number of linguistic features as quality indicators. These represent two dimensions of quality, namely readability/formality (e.g. word length and passive constructions)...

متن کامل

Spiritual Health Indicators in Imam Ali's Speeches

Introduction: Spiritual health is explained as the fourth dimension of health based on the cultural and religious contexts and religious thinkers of each society. Therefore, the purpose of this study is to explain the indicators of spiritual health from the perspective of Imam Ali. Method: This descriptive-analytical study is a qualitative content-oriented analysis that collects some of the va...

متن کامل

A generalized framework for deriving nonparametric standardized drought indicators

This paper introduces the Standardized Drought Analysis Toolbox (SDAT) that offers a generalized framework for deriving nonparametric univariate and multivariate standardized indices. Current indicators suffer from deficiencies including temporal inconsistency, and statistical incomparability. Different indicators have varying scales and ranges and their values cannot be compared with each othe...

متن کامل

Deriving Semantic Knowledge from Descriptive Texts Using an MT System

This paper describes the results of a feasibility study which focused on deriving semantic networks from descriptive texts using controlled language. The KANT system 3, 6] was used to analyze input paragraphs, producing sentence-level interlingua representations. The in-terlinguas were merged to construct a paragraph-level representation, which was used to create a semantic network in Conceptua...

متن کامل

Deriving semantic annotations of an audiovisual program from contextual texts

The aim of this paper is to explore whether indexing terms for an audiovisual program can be derived from contextual texts automatically. For this we apply natural-language processing techniques to contextual texts of two Dutch TV-programs. We use a Dutch domain thesaurus to derive and rank the metadata. We evaluate the results by comparing them to human made descriptions.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006